Parallel Performance Evaluation of Sequence Nucleotide Alignment on the Supercomputer BlueGene/P
نویسندگان
چکیده
Bioinformatics is a scientific area requiring powerful computing resources for exploring large sets of biological data. Sequence alignment is an important method in DNA and protein analysis. BLAST has become the most popular tool and implements a fast heuristic method for sequence alignment and searching. The goal of this paper is to estimate the scalability of parallel sequence alignment on the supercomputer BlueGene/P for the case study of investigating the interaction between influenza virus A and the host genome. Parallel performance evaluation of sequence alignment have been performed experimentally on the basis of parallel mpiBlast program implementation and conducted on a local mirror database comprising the available isolates of the influenza virus A and the human genome. The molecular biology outcome of the experiments is that the similarity of influenza virus A and human genome have been determined. Key-Words: Biocomputing, High Performance Computing, Human Genome, Influenza Virus, mpiBLAST, Parallel Performance, Sequences Alignment.
منابع مشابه
Scaling of Parallel Software for Biological Sequences Alignment and Homology Search on the Supercomputer BlueGene/P
The goal of this paper is to propose the performance evaluation of the scaling of parallel software for biological sequence alignment and homology searching based on blast algorithm for sequence searching and clustalw algorithm for multiple sequence alignment on the supercomputer BlueGene/P for the case study of influenza virus sequences variability and homology searching with human genome.
متن کاملComputational Challenges in Biological Sequence Processing and In-silico Molecular Biology Experiments
Biological sequence processing is a key of information technology for molecular biology. This scientific area requires powerful computing resources for exploring large sets of biological data. The huge amount of biological sequences accumulated in the world nucleotide and protein databases requires efficient parallel tools for structural genomic and functional analysis. The paper describes the ...
متن کاملComputational Aspects of In-silico Experiments for Investigating the Impact of the Host Genome on the Influenza Virus A Variability
Nowadays the study of the variability of influenza virus is a problem of very great importance. Influenza type A viruses cause epidemics and pandemics. The problem of restricting the spreading of pandemics and the treatment of the people infected by the influenza virus is widely based on the latest achievements of molecular biology, bioinformatics and biocomputing, as well as many other advance...
متن کاملOptimization of Multiple Sequence Alignment Software ClustalW
* Corresponding author. E-mail address: [email protected] ‡ Corresponding author. E-mail address: [email protected] † Corresponding author. E-mail address: [email protected] Abstract This activity with the project PRACE-2IP is aimed to investigate and improve the performance of multiple sequence alignment software ClustalW on the supercomputer BlueGene/Q, so-called JUQUEEN, for the case study o...
متن کاملAvailable online at www.prace-ri.eu Partnership for Advanced Computing in Europe
In silico biological sequence processing is a key task in molecular biology. This scientific area requires powerful computing resources for exploring large sets of biological data. Parallel in silico simulations based on methods and algorithms for analysis of biological data using high-performance distributed computing is essential for accelerating the research and reducing the investment. Mult...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011